Constrained Parameter Estimation of Harmonic and Inharmonic Models for Separating Polyphonic Musical Audio Signals
نویسندگان
چکیده
This paper describes a sound source separation method for polyphonic sound mixtures of music including both harmonic and inharmonic sounds, and constrained parameter estimation using standard MIDI files as prior information. The difficulties in dealing with both types of sound together have not been addressed in most previous methods that have focused on either of the two types separately, because the properties of these sounds are quite different. We therefore developed an integrated weighted-mixture model consisting of both harmonic-structure and inharmonic tone models. On the basis of the MAP estimation using the EM algorithm, we estimated all model parameters of this integrated model under several original constraints for preventing over-training and maintaining intra-instrument consistency. We confirmed that the integrated model increased the SNR by 1.5 dB.
منابع مشابه
Source Separation of Musical Instrument Sounds in Polyphonic Musical Audio Signal and Its Application
A change of music appreciation style from “listening to high fidelity (Hi-Fi) sounds” to “listening to preferred sounds” has emerged due to evolution of digital audio processing technology for the past years. Previously, many people enjoyed passive music appreciation: e.g., they buy CD and phonograph recordings or download mp3 audio files, set the disks or files to various media players, and hi...
متن کاملPolyphonic Pitch Tracking Using Joint Bayesian Estimation of Multiple Frame Parameters
We present a novel approach to pitch estimation and note detection in polyphonic audio signals. We pose the problem in a Bayesian probabilistic framework, which allows us to incorporate prior knowledge about the nature of musical data into the model. We exploit the high correlation between model parameters in adjacent frames of data by explicitly modelling the frequency variation over time usin...
متن کاملPreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals
This paper describes a real-time method, called PreFEst (Predominant-F0 Estimation method), for estimating the fundamental frequency (F0) of simultaneous sounds in monaural polyphonic audio signals. Without assuming the number of sound sources, PreFEst can estimate the relative dominance of every possible harmonic structure in the input mixture. It treats the mixture as if it contains all possi...
متن کاملA Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals
In this paper I introduce a method, called PreFEst, for estimating the fundamental frequency (F0) of simultaneous sounds in monaural polyphonic audio signals. Most previous F0-estimation methods have had difficulty dealing with such complex audio signals because these methods were designed to deal with mixtures of only a few sounds. Without assuming the number of sound sources, PreFEst can esti...
متن کاملHarmonic Temporal Structured Clustering for Multiple Fundamental Frequency Estimation
This abstract describes a method for the Multiple Fundamental Frequency (F0) Estimation and Tracking task in the Music Information Retrieval Evaluation eXchange (MIREX) 2007. The method is called Harmonic Temporal Structured Clustering (HTC), which is a kind of constrained Gaussian Mixture Model (GMM) estimation using EM algorithm. It can jointly extract F0, intensity, onset, duration of each u...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010